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Determining the quantum circuit complexity of a unitary operation is closely related to the prob- 
lem of finding minimal length paths in a particular curved geometry [Nielsen et al, Science 311, 
1133-1135 (2006)]. This paper investigates many of the basic geometric objects associated to this 
space, including the Levi-Civita connection, the geodesic equation, the curvature, and the Jacobi 
equation. We show that the optimal Hamiltonian evolution for synthesis of a desired unitary nec- 
essarily obeys a simple universal geodesic equation. As a consequence, once the initial value of 
the Hamiltonian is set, subsequent changes to the Hamiltonian are completely determined by the 
geodesic equation. We develop many analytic solutions to the geodesic equation, and a set of in- 
variants that completely determine the geodesies. We investigate the problem of finding minimal 
geodesies through a desired unitary, U, and develop a procedure which allows us to deform the 
(known) geodesies of a simple and well understood metric to the geodesies of the metric of interest 
in quantum computation. This deformation procedure is illustrated using some three-qubit nu- 
merical examples. We study the computational complexity of evaluating distances on Riemmanian 
manifolds, and show that no efficient classical algorithm for this problem exists, subject to the 
assumption that good pseudorandom generators exist. Finally, we develop a canonical extension 
procedure for unitary operations which allows ancilla qubits to be incorporated into the geometric 
approach to quantum computing. 



I. INTRODUCTION 

A central problem of quantum computation is to find 
efficient quantum circuits to synthesize desired unitary 
operations. These unitary operations are used to solve 
computational problems such as factoring [l|, 0, H[ . De- 
spite intensive effort, few general principles are known 
either for finding efficient quantum circuits or for prov- 
ing that a given computational problem has no efficient 
circuit. 

A geometric approach to quantum circuit complexity 
has recently been developed in 0, H, @ ■ The idea is to 
introduce a Ricmannian metric on the space of n-qubit 
unitary operations, chosen in such a way that the met- 
ric distance d{I, U) between the identity operation and 
a desired unitary U is equivalent (modulo some techni- 
cal caveats, discussed below) to the number of quantum 
gates required to synthesize U . Thus the distance d(I, U) 
is a good measure of the difficulty of synthesizing U. 

This geometric reformulation suggests that the tools of 
Riemannian geometry may be useful in analyzing quan- 
tum circuit complexity. The purpose of this paper is to 
develop in detail many of the basic geometric notions 
that can be associated to quantum computation, includ- 
ing the Levi-Civita connection, geodesies, geodesic in- 
variants, curvature, Jacobi fields and conjugate points. 
We also discuss obstacles to the use of geometric ideas to 
analyze quantum circuit complexity. 

Structure and content of the paper: Section [TTJ 
reviews the relevant background results connectingge- 
ometry and quantum circuit complexity, from 0, lalfj] • 
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Section HTT1 derives simple formulae for, and several ba- 
sic properties of, the Levi-Civita connection on our Rie- 
mannian manifold. With these results in hand all other 
geometric quantities computed later in the paper may 
be obtained through relatively straightforward computa- 
tions. In particular, Appendix lAlcomputes all the natural 
curvature quantities for the manifold, including the cur- 
vature tensor, the sectional curvature, the Ricci tensor, 
and the scalar curvature. 

Section HVl uses the connection to derive the geodesic 
equation. This is a simple and (we believe) rather ele- 
gant equation that determines the locally optimal Hamil- 
tonian evolution for synthesis of any desired unitary. We 
obtain a complete set of constants of the motion for the 
geodesic equation, as well as many simple exact solu- 
tions, including a completely general exact solution for 
three qubits. 

Section [V] develops a procedure for numerically finding 
geodesies passing through a desired endpoint, i.e., a goal 
unitary. We begin with a review of the theory of Jacobi 
fields and conjugate points, which make use of the curva- 
ture to study the divergence or convergence of geodesies 
on a manifold. These tools can be used to study when a 
geodesic is no longer globally minimizing, but is merely a 
local minimum, and we briefly digress to investigate this 
phenomenon numerically for a class of unitaries associ- 
ated with the transverse Ising model. Returning to the 
main point of the section, finding geodesies to a goal uni- 
tary, the basic idea is to smoothly deform the geodesies 
of a simple and well understood metric to the geodesies 
of the metric of computational interest. This is done us- 
ing a notion of a geodesic derivative along a flow through 
the space of metrics. The idea is to fix the geodesic end- 
point, and then the geodesic derivative describes how the 
initial velocity of the geodesic must change as the metric 
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is changed, in order that the deformed geodesic passes 
through the same endpoint. We develop necessary and 
sufficient conditions for the geodesic derivative to exist, 
and a formula for it when it does exist. These necessary 
and sufficient conditions are naturally expressed in terms 
of the conjugate points studied earlier in this section. 
We then use this deformation procedure to numerically 
find geodesies passing through some unitary operations 
of interest, including randomly chosen unitaries, and the 
quantum Fourier transform. 

In Section|Vl]we discuss the general prospects for using 
geometric ideas to analyze quantum circuit complexity. 
We describe two important obstacles to using geomet- 
ric ideas to prove nontrivial upper or lower bounds on 
quantum circuit complexity, and prove two related tech- 
nical results. The first obstacle is the Razborov-Rudich 
theorem, a well-known result in classical computational 
complexity. This is essentially a no-go theorem that, sub- 
ject to certain assumptions, rules out a wide class of ap- 
proaches to proving circuit lower bounds. We outline 
a quantum analogue of the Razborov-Rudich theorem, 
and use it to argue that the general problem of finding 
geodesies on a Riemannian manifold is likely to have no 
efficient (classical) solution. This suggests that proving 
complexity lower bounds using geometric techniques will 
require us to use non-generic properties of specific uni- 
taries. The second obstacle discussed in this section is 
an apparent shortcoming in prior work on the geometric 
approach to quantum computing, which is that it was 
developed for the analysis of quantum circuits which do 
not make use of ancillary working qubits. We show how 
to avoid this restriction by using a canonical extension 
procedure for unitary operations which allows ancillas to 
be incorporated into the geometric point of view. This 
canonical extension procedure may be of independent in- 
terest. 

Background: We assume throughout that the reader 
is familiar with quantum circuits at the level of, e.g., 
Chapter 4 of @ , and with elementary Riemannian geom- 
etry, at the level of, e.g., @, Q. In particular, we assume 
a working knowledge of notions such as tensor fields, the 
Levi-Civita connection, the geodesic equation, and the 
curvature tensor. Much of our presentation is concerned 
with properties of a special type of Riemannian man- 
ifold, known as a right-invariant manifold. However, 
the results we need about right-invariant Riemannian 
manifolds are not easily accessible in a single (or even 
a few) publications, so far as we are aware. Therefore, 
to make the paper accessible, we have developed in a 
self-contained way the main results about right-invariant 
manifolds. The reader curious to investigate the litera- 
ture further should be warned that both left- and right- 
invariant manifolds are widely studied, but differ only 
trivially, and results about one can always be transformed 
into results about the other, sometimes with changes of 
sign. 

Prior work: Geometric techniques have been used 
previously in the study of quantum information process- 



ing. In particular, Khaneja et al 0, [T(| [H| ( see a ^ so 
related ideas in [121 ]) have used powerful techniques from 
the theory of symmetric spaces to study time-optimal 
control. This has been extremely successful in the two- 
qubit case, leading to an essentially complete character- 
ization of two-qubit time optimal control. In the many- 
qubit scenario some successes have been achieved, but the 
need for the rather specialized symmetric space structure 
limits the breadth of possible applications. Related ideas 
have also been investigated by Carlini et al [HI, [l4j] , who 
developed variational principles for time-optimal synthe- 
sis of quantum states and of unitary transformations. 
Time-optimal quantum control of unitary operations has 
a long history; in addition to the above references, we re- 
fer the reader to [l5|, [l(| for recent work, and references 
to earlier literature. 

Perspective: The geometric approach to quantum 
computation is in its infancy, and its long-term merits 
remain to be determined. This and earlier papers do not 
yet offer a killer application of geometric ideas, not avail- 
able through conventional circuit analysis. However, we 
believe that there are reasons to hope that the geometric 
viewpoint will eventually enable insights difficult to ob- 
tain in the conventional circuit picture. In particular, by 
recasting the problem of quantum circuit complexity in 
terms of smooth mathematical objects, we bring the prin- 
ciples of the calculus of variations into play. This allows 
us to derive a geodesic equation describing the locally op- 
timal evolutions generating a desired unitary; intuitively, 
this is to fall freely along a minimal geodesic, with the 
motion determined entirely by the local geometry of the 
space, and the initial direction of motion. This is in con- 
trast to the circuit picture, where no such principles are 
available to derive the locally optimal way to construct a 
circuit. The present paper develops a fairly complete pic- 
ture of the basic geometry of quantum computation, and 
lays the foundation for a more detailed understanding. 



II. QUANTUM CIRCUIT COMPLEXITY AND 
GEOMETRY 

In this section we review the connections between Rie- 
mannian geometry and quantum circuit complexity, as 
developed in 0, la, El ■ 

We begin by recalling a few basic ideas from Rieman- 
nian geometry. We will actually consider Riemannian 
metrics on two slightly different manifolds, the group 
M = U (2 n ) of n-qubit unitary operations, and the group 
M = SU(2 n ) of n-qubit unitaries with unit determinant. 
For the most part the development is identical for the two 
cases, and we will not explicitly delineate them. However, 
there are a few cases where it is advantageous to use one 
or the other, and we mention explicitly when this is the 
case. For definiteness, you may assume that we are work- 
ing with SU(2 n ) unless otherwise specified. 

A tangent vector to a point on this manifold (i.e., an n- 
qubit unitary, U, with unit determinant) can be thought 
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of as a traceless Hamiltonian, H, i.e., an element of the 
Lie algebra su(2 n ) of traceless 2™ x 2™ Hermitian matri- 
ces. More precisely, we can identify a tangent vector at 
U with the tangent to the curve e~ lHt U at t = 0. We 
shall call H the Hamiltonian representation of this tan- 
gent vector 1 . With these identifications, the Riemannian 
metric {-,-)u at & point U is a positive-definite bilinear 
form (H, J) u defined on traceless Hamiltonians H and 
J. Through most of this paper we assume that this bi- 
linear form is constant as a function of U and so write 
(■,-)u = (•)•). A metric which is constant in this way is 
known as a right-invariant metric. 

Suppose U(t) is a curve in SU{2 n ) generated by the 
Hamiltonian H(t) according to the Schrodingcr equation 
U = —iHU. Then the length of that curve is given by 



dt{H{t),H{t)) 1 ' 2 . 



(1) 



The distance d(U, V) between points U and V in SU(2 n ) 
is defined to be the minimal length of any curve joining 
those two points. 

The metric of most interest in this paper is defined as 
follows. Let V be the subspace of n-qubit Hamiltonians 
which contain only f- and 2-body terms, that is, their 
Pauli operator expansion contains only terms of weight 
at most two, e.g., X ® I n ~ x ,I ® X ® Y <g) I n ' 3 . Let Q 
denote the complementary subspace of n-qubit Hamil- 
tonians containing only 3- and more-body terms. Ob- 
serve that su(2 n ) — V + Q, i.e., any Hamiltonian can be 
(uniquely) decomposed as a sum H = Hp + Hq of one- 
and two-body terms, Hp, and of three- and more-body 
terms, Hq. Overloading notation, we now define maps 
V and Q by V(H) = H P and Q(H) = Hq. We define 
a right-invariant Riemannian metric which we call the 
standard metric by: 

ti(HP(J))+qtr(HQ(J)) 



(H,J) 



(2) 



The term q is a penalty parameter 2 , which is chosen to 
be sufficiently large (q > 4™ can be shown to work). The 
intuition behind this choice of q can be understood by 
viewing the length yj (H(t), H(t)) as a cost for applying 
the Hamiltonian H(t), and so choosing q exponentially 
large imposes a very large cost for the direct application 
of three- and more-qubit gates. 

The metric of Equation ([2]) induces a distance d(-, •) on 
SU(2 n ), as described above. This distance function may 
be related to quantum gate complexity by the following 
inequalities, proved in [J, |5[: 

w^!i<^ )SG(co . (3) 



1 In an earlier paper [j| the Pauli expansion coefficients of H were 
referred to as local co-ordinates for the tangent vector. We shall 
not use this terminology here. 

2 Note that our definition and notation for the penalty parameter 
has changed slightly from 0, @, @] • 



In the first inequality, G(U, e) is the approximate gate 
complexity G(U, e) of U, defined to be the minimal num- 
ber of one- and two-qubit gates required to synthesize 
some n-qubit unitary operation V such that \\U— V\\ < e, 
where || • || is the usual matrix norm, and no ancilla qubits 
are used in the computation. The values bo,bi, 62 and 63 
are positive constants. The papers 0, @ proved this re- 
sult for bi = 1/3, 62 = 2/3 and 63 = 2; a value for 60 was 
not calculated explicitly, and won't be needed in this pa- 
per in any case. Examination of 0, [f| reveals that it 
is certainly possible to obtain stronger bounds on these 
constants, but a tight analysis remains to be done. In the 
second inequality in Equation ([3]), G(U) is the exact gate 
complexity of U, i.e., the minimal number of one- and 
two-qubit gates required to synthesize U exactly, with 
no ancilla qubits used in the synthesis. Thus, these two 
inequalities may be summarized by saying that the dis- 
tance d(I, U) gives us both a lower bound on the exact 
gate complexity, and an upper bound on the approximate 
gate complexity of synthesizing U. 

Equation ([3]) can be generalized by using universal 
families of Hamiltonians other than the one- and two- 
qubit Hamiltonians. The idea is to choose a subspace V 
of n-qubit Hamiltonians which has only polynomial (in 
n) dimension, and which is universal for computation. 
We choose a subspace Q' such that su(2 n ) = V' + Q! . 
Overloading notation as before, we define a new metric, 

{H jy s tT(HV'(J))+qtT(HQ'(J)) (4) 

It can be shown using the techniques of [j, @ that pro- 
vided q is sufficiently large, then 



P o\y(G'(U,e),n,e) < d'(I,U) <G'(U), 



(5) 



where poly is some polynomial (with constant but pos- 
sibly non- integer powers), and the primes indicate the 
change in choice of universal gates and of metric. More 
generally, we shall say that any metric related to gate 
complexity by a relationship of the form of Equation ([5]) 
is a computational metric. 

The metric of most interest to us in the remainder of 
the paper is the standard metric of Equation @ . How- 
ever, many of the results we prove will hold in more gen- 
erality, for projective metrics, by which we mean a metric 
of the form of Equation ^ , or even more generally for 
right-invariant metrics. 



III. THE CONNECTION 

In this section we derive several explicit formulae for 
the Levi-Civita connection, which is the basic geometric 
object on the manifold. These results are standard in the 
literature on right-invariant Riemannian manifolds (see, 
e.g., Appendix 2 in [13] for an introduction), although 
our derivation is from a slightly unusual point of view, 
being based on the background in differential geometry 
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most common to physicists. Many of the same results can 
be derived from a more abstract point of view by starting 
with the general formula for the Levi-Civita connection 
(e.g., Equation (5.1) on page 69 of ||). We derive these 
results here in part for completeness, and also because 
it provides an opportunity to introduce many items of 
notation and nomenclature used in subsequent sections. 
Note that Appendix [X] uses these results to compute all 
the natural curvature quantities associated to our metric. 

If Y and Z are vector fields on a manifold, then in a 
fixed co-ordinate system {x k } the connection is given by: 



dz 



kj 



(6) 



where y k ,z k are natural co-ordinate representations for 
the vector fields Y and Z with respect to the co-ordinate 
system {x k }, summation over repeated indices is implied, 
and T J k[ are the Christoffel coefficients. Expressing the 
metric tensor gjk with respect to the same system of co- 
ordinates we have: 



1 kl 



l Jm ^mkl — (g m k,l + 9ml,k — 9kl,m) > (7) 



where a subscript j denotes a partial derivative with re- 
spect to x\. 

We have defined our metric in Equation |2]) using a rep- 
resentation for tangent vectors which identifies a Hamil- 
tonian H with the tangent to e~ tHt U at t = 0. It is nat- 
ural to hope that there is a co-ordinate representation for 
this Hamiltonian (e.g., the Pauli expansion coefficients) 
which can be identified in a natural way with a set of 
co-ordinates such as the Xj above. Unfortunately, such 
a representation does not exist. Instead, to evaluate the 
Christoffel symbol we must first introduce a fixed system 
of co-ordinates on the manifold, which we shall call Pauli 
co-ordinates. We then re-express the metric in terms of 
these Pauli co-ordinates, and use this re-expression to 
evaluate the Christoffel coefficients. This can then be 
used to obtain co-ordinate independent representations 
for the connection and other geometric objects. 

Pauli co-ordinates: To evaluate the connection it 
is sufficient to introduce co-ordinates defined in a neigh- 
bourhood of the origin, / 6 SU(2 n ). This enables us 
to evaluate the connection at the origin, and the right- 
invariance of the metric then gives a general expression 
for the connection everywhere on the manifold. We define 
Pauli co-ordinates by representing U in a neighbourhood 



of the origin as U 



-iX 



Such an X can be defined 



in a unique way using the standard branch of the loga- 
rithm. We can associate a co-ordinate vector x to X via 
X = x-a, i.e., the vector x consists of the Pauli expansion 
coefficients of X. Note that x a = tr(Xcr) /2™. We call the 
matrix X the Pauli representation for U, and the com- 
ponents x a the Pauli co-ordinates. We shall also use this 
terminology for the natural corresponding co-ordinates 
on the tangent spaces, and on the tangent bundle. 

We have defined the metric in terms of a Hamiltonian 
representation for tangent vectors. Our goal now is to re- 



express the metric in terms of the Pauli representation. 
Suppose we are at a point U = e~ lX on the manifold, and 
that a tangent vector is specified there by the Hamilto- 
nian H . What is the corresponding representation of the 
tangent vector in the Pauli co-ordinate representation for 
the tangent space T v SU{2 n )l 

To answer this question, we rewrite the tangent curve 



-iHt e -iX 
-i(X+t.T) 



-iHt^-iX 



corresponding to H in the form e 
- 0(t 2 ). The Baker-Campbell-Hausdorff for- 



mula gives us: 



H = £ X {J) = iad* 1 (e- iad - - j) (J), 



(8) 



where adx(^) = [X, Y], [•, •] denotes the matrix commu- 
tator, and 1{Y) = Y . This formula relates the Hamil- 
tonian representation, of the tangent vector, to the 
Pauli representation, J, of the same tangent vector. 

It can be shown that Ex is invertible near the origin, 
i.e., for X sufficiently close to 0. We denote the inverse 
by T>x = Ex 1 > so we nave J = T>x{H). Note that Ex 
has the power series expansion 



£ 



x 



(-iad x ) J 



(9) 



To compute the connection at the origin it suffices to 
have expansions of Ex and T> x to first order in X: 



Ey 



_ «adx „,„ 9 , „ ^ i&dx 



0{X 2 



(10) 



Finally, it is helpful to note that the adjoint of these 
superoperators with respect to the trace inner product 
(X,Y) = tr(Xty) satisfies E x = £- X and V x = V_ X - 

The metric in Pauli co-ordinates: In SectionHIlwc 
defined the metric with respect to the Hamiltonian repre- 
sentation. We now rewrite the metric in a neighbourhood 
of the origin with respect to the Pauli co-ordinates. This 
procedure can be carried out for a general right-invariant 
metric with essentially no extra effort beyond what is re- 
quired for the standard metric. The most general form 
for a right-invariant metric in the Hamiltonian represen- 
tation is: 



(H,J) = 



tv(HG(J)) 



(11) 



where Q is a strictly positive (and thus self-adjoint) su- 
peroperator, i.e., a linear operator on matrices such that 
tc(HQ(H)) > whenever H ^ 0. For the standard met- 
ric, the explicit form of Q is Q = V + qQ. It will be conve- 
nient for later use to define a Hermitian matrix L = Q{H) 
dual to the Hamiltonian, H . Note that the dual satisfies 
(H, J) = tr(LJ)/2" for all J. 

Suppose Y and Z are tangent vector fields defined in 
a neighbourhood of the origin. We will also use Y and 
Z to denote the specific values these vector fields take at 



a point U 



-iX 



in that neighbourhood. Let Y , Z 
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denote the Pauli representation for these vectors, and 
Y H = £ X {Y P ) and Z H = £ X {Z P ) denote the corre- 
sponding Hamiltonian representation. The metric then 
is given by: 



(Y,Z) = 



ti{Y H Q{Z H )) 
2™ 

tr(£ x (Y p )go£ x (Z p )) 
2" 

tr{Y p £ ] x ogo£ x {Z p )) 
2™ 



(12) 
(13) 
(14) 



Define Q x = £ x oQo£ x , so (Y, Z) = tv{Y p g x {Z p ))/2 n . 
With respect to the Pauli co-ordinates y a and z a we 

have Y p — V a<J an d Z p = J2 a z(7(T ' an( ^ thus m this 
co-ordinate representation the metric tensor has compo- 
nents 



9a, 



The inverse g aT is given by 

fTT _ tr(^x(r)) 



(15) 



(16) 



where f x e^ 1 =P x of o and J 7 = g -1 . To 
compute first derivatives, we note that 



Qx = G + -l&d x ,g} + 0(X 2 ). 



(17) 



Using the cyclic property of trace, and the fact = g, 
a computation shows that at the origin 



itr(([g(<r),T] + [g(T),q])/j) 

2 «+i 



(18) 



Other equivalent forms are possible; this form seems to 
us to be particularly easy to recall. 

The Christoffel symbol: The Christoffel symbol 
rg T = g pp T liaT may be computed at the origin by ob- 
serving that 



(19) 



= —tr M[a,g(T)] + [T,g(a)])). (20) 
Using g pfi — tr(J 7 (p)fi)/2 n we obtain 

Kr = ^tr (HP)(1°, Sir)] + [r, Q{a)])) . (21) 

It is worth noting that this formula holds in considerable 
generality. The only critical property of the Pauli matri- 
ces used in the derivation is that they are orthonormal 
(up to a constant factor) with respect to the trace inner 
product. 

The connection: Working in the Pauli representation 
and using Equations ijSJ) and (f2"Tj) . we have at the origin: 



(v Y zf = y °z p + -t ([r p , g(z p )} + [z p , g(Y p )]) 



(22) 



This equation gives a formula for the connection eval- 
uated at the origin, when the vector fields are written 
in the Pauli representation. This can be re-expressed 
in the Hamiltonian representation by observing that at 
the origin (V Y Z) H = (V Y Z) P , Y k = Y p (and thus 
the y a components are the same in both representa- 
tions), and Z H = Z p . Finally, we have Z p = V X (Z H ) 
near the origin, and thus using Equation (|10|) we obtain 
Z p = (i/2)[a,Z H ] + Z H a at the origin. This gives: 



(VyZf = y°Z H a + l -(lY H ,Z H ] 



-T{[Y H ,g(z H )] + [z H ,g(Y H ) 



(23) 



Note that the partial derivative in Z H a is still with respect 
to the Pauli co-ordinates x a on the manifold. 

Suppose now that we have a curve that passes through 
the origin and that has tangent Y at the origin. Using 
Equation (f23f we see that the covariant derivative along 
the curve D t Z = VyZ is given in the Hamiltonian rep- 
resentation by: 



(D t Z) H 



dZ 



H 



U[Y H ,Z»] 



dt ' 2 

hT([Y H ,g(z H )] + [z H ,g(Y H )])) 



(24) 



Note that because of the right-invariance of the metric 
this equation is true everywhere on the manifold. 

The formula for the connection simplifies in the spe- 
cial case when Z is a right-invariant vector field. In this 
case Z H does not vary as a function of position, and so 
Equation ([23]) gives: 



{VyZ) h 

= l - ([Y H , Z H ] + T {[Y H , g(Z H )} + [Z H , g(Y H )})) . 

(25) 

Once again, because of the right-invariance of the metric 
this equation is true everywhere on the manifold. A con- 
sequence of this equation that will be useful later is that 
{X, \7yZ) = -(V^X, Z) for any triple of right-invariant 
vector fields, X, Y and Z. 

Note that in subsequent sections we work almost en- 
tirely in the Hamiltonian representation. As a con- 
sequence, when applying formulae like Equations (|23[) 
and (|24p we will drop the superscript H 's. 



IV. GEODESICS 

In this section we present the geodesic equation (Sub- 
section IIV Ap , develop a complete set of constants of 
the motion for the geodesic equation (Subsection IIV Bl) , 
and describe several classes of analytic solutions to the 
geodesic equation (Subsection IIV CP . 



6 



A. The geodesic equation 

By definition, a geodesic in SU(2 n ) is a curve U(t) 
whose tangent vector H (t) satisfies the condition D t H = 
0, i.e., the tangent vector is parallel transported along 
the curve. Using Equation ([24)) this becomes 



(26) 



= H + iF([H,Q(H)]). 



This equation is more conveniently rewritten in terms of 
the dual L = Q{H) = ^{H). After a little algebra we 
obtain the geodesic equation in the form we shall most 
commonly apply it, 



L = i[L, F(L)}. 



(27) 



We shall refer to this equation as the geodesic equation, 
to distinguish it from other equivalent forms. A third 
form which is often used in the literature is the form 
(H,J) = i{H, [H, J]), valid for any J e su(2 n ). Note 
that the geodesic equation is a well-known result in the 
literature on right-invariant Riemannian manifolds (see, 
e.g., Appendix 2 of ) ■ Note also that Equation is 
in the class of Lax equations well known to mathemati- 
cians. 

In the special case of the standard metric the geodesic 
equation simplifies nicely. Recalling that Q = V + qQ, 
and thus T = Q^ 1 = V + q~ x Q, we obtain 



fl-g" 1 ) [£,?>(£)]. 



(28) 



Provided g ^ 1 we can remove the dependence on q by 
defining a rescaled version of L, M = (1 — q~ 1 )L, ob- 
taining a form of the geodesic equation independent of q, 
except for the requirement q ^ 1: 



M = i[M,V(M)]. 



(29) 



This is a remarkable equation. Because the minimal 
length path between any two points on a Riemannian 
manifold is a geodesic, we may assume without loss of 
generality that the optimal Hamiltonian H(t) generating 
any unitary operation is determined by a solution M(t) 
to Equation (J2S]) via the rescaling H = g(M)/(l - q^ 1 ). 
Equation (|29j) is thus a single universal equation whose 
solutions determine the paths of minimal length on the 
manifold. This situation is in vivid contrast with how we 
usually think about the standard circuit model of quan- 
tum computing, where H (t) may have arbitrary time de- 
pendence 3 . It is also notable that Equation (f2"B")) is (ar- 
guably) the simplest and most elegant equation involving 
both the Lie group structure, expressed through the Lie 



bracket, and also the map V onto the set of Hamiltonians 
that are regarded as computationally easy to implement. 

As a caveat to this optimistic picture, note that being 
a geodesic is merely a necessary condition, not a suffi- 
cient condition, for a path to be minimal. In particular, 
there may be many geodesies passing from / to a desired 
unitary, U, and not all those geodesies will be globally 
minimizing 4 . The situation is analogous to minimizing 
a function f(x) in conventional calculus: the condition 
f'(x) — is a constraint that must be satisfied by x min- 
imizing f(x), but further analysis is necessary to deter- 
mine if f(x) is a global minimum. Thus finding geodesies 
is only a first step towards the determination of the dis- 
tance d(I, U). 



B. Constants of the motion 

The geodesic equation (Equation (|2"T|) ) on a right- 
invariant Riemannian manifold has a corresponding set 
of constants of the motion which completely determine 
the geodesies [I?} • To see this, observe that for any 
choice of Lq the function L(t) = U(t)L^U{t)^ satisfies 
the geodesic equation, Equation (f27|) . It follows that 
along any geodesic the function U (t)^ L(t)U (t) = Lq is a 
matrix-valued constant of the motion. Furthermore, by 
differentiating the equation L{t) = U(t)LoU(ty we may 
recover the geodesic equation, and thus this set of con- 
stants of the motion completely determines the geodesies 
of the system. 

As an aside, it is possible to derive these constants of 
the motion (and thus the geodesic equation) by observing 
that the metric is invariant under a continuous symme- 
try group, namely, arbitrary right translations of SU{2 n ). 
One may then use Noether's theorem to find the associ- 
ated constants of the motion, which turn out to be pre- 
cisely the matrix elements of U(tY L(t)U(t). This is the 
approach taken to derive the geodesic equation in [l7T |. 

One-body terms are constants of the motion: In 
the special case of the standard metric, it can be shown 
that the coefficients of the one-body terms in the Pauli 
expansion of H(t) are also constants of the motion along 
geodesies. This fact is useful in developing certain ana- 
lytic solutions to the geodesic equation, to be described 
later. To prove that the one-body terms are constant, let 
S(X) map the n-qubit matrix X onto just its one-body 
terms. We see that: 

™& = s(%) (30) 
dt \dt J y ' 

= i{\-q- x )S{\L,V{L)\) (31) 

= i{\-q^)S(\Q(L),V{L)\). (32) 



This appears to be a major advantage of the geometric approach 
over the circuit approach. It comes, however, at a cost. In the ge- 
ometric approach H (f) may include (exponentially small) three- 
and more-body terms, while in the circuit model only one- and 
two-body terms appear in the Hamiltonian. 



4 They are, however, locally minimizing in the sense that any suffi- 
ciently small arc along any geodesic is always a global minimum 
of the length functional (jigf. pp 222-226). 
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The Pauli commutation relations imply that: (1) the 
commutator of Q(L) with the one-body terms in V{V) 
produces only three- and more-body terms, and (2) the 
commutator of Q(L) with the two-body terms in V(L) 
produces only two- and more-body terms. As a result S 
annihilates [Q(L), V(L)], and so S(L) is a constant of the 
motion. 

An alternative proof that the one-body terms are con- 
stants of the motion may be found by applying Noether's 
theorem to the continuous symmetry SU(2 n ) — > SU(2 n ) 
defined hyU—> VUV\ where V is an arbitrary one-qubit 
unitary. 



C. Analytic solutions to the geodesic equation 

We now develop a range of partial and full solutions 
to the geodesic equation. Our results are mostly special- 
ized to projective metrics, and some results are further 
specialized to the standard metric. In IIV C J we develop 
general necessary and sufficient conditions for geodesies 
to be of the form e~ tHt for a constant Hamiltonian H. 
In IIV C 21 we find an exact form for the geodesies of the 
standard metric in the three-qubit case, for the q — > oo 
limit. This form is based on an algebraic structure that 
is also useful in other contexts. Finally, in IIV C 31 we 
develop a formal power series solution to the geodesic 
equation. 



1. Geodesies where the Hamiltonian is constant 

Along certain geodesies the Hamiltonian is constant, 
and thus the geodesic has the form e~ %m . To determine 
when this is the case it suffices to determine when the 
dual L(t) is constant along a geodesic, since the dual is 
related to the Hamiltonian by a fixed invertible transfor- 
mation. From the geodesic equation for projective met- 
rics, L = i(l — V(L)], we see that in the q = 1 
case L is always constant along geodesies. However, the 
case of most interest to us is when q is very large, where 
we see that a necessary and sufficient condition for L to 
be constant is that [Q(L),V(L)} = 0. This is equiva- 
lent to the condition that [Q(H),V(H)} = 0. When this 
condition is satisfied, and only when it is satisfied, the 
geodesic is of the form ex-p(-iHt). 

In the case of the standard metric, we see that this 
condition is that the one- and two-body terms in a 
Hamiltonian should commute with the three- and more- 
body terms. An appealing consequence is that when- 
ever H contains only one- and two-body terms, then 
cxp(-iHt) is a geodesic. Over sufficiently short time 
periods geodesies are guaranteed to be minimal length 
curves (see, e.g., Section 3.3 of Q), and so this result ac- 
cords with the intuition that the fastest way to simulate 
a physical system is with its own evolution. 



2. Geodesies for three qubits 

In the case where there are only three qubits we can 
derive a solution to the geodesic equation for the stan- 
dard metric that is exact in the large q limit. In fact, it 
turns out to be possible to analyze a more general class 
of metrics, defined by the choice 

g = sS + T+ qQ, (33) 

where S maps onto the subspace of three-qubit Hamil- 
tonians which contain only one-body terms, T maps 
onto the subspace containing only two-body terms, and 
Q maps onto the subspace containing only three-body 
terms. In the case s = 1 this reduces to the standard met- 
ric. The limit s — > corresponds to the case where one- 
body Hamiltonians may be applied effectively for free. 

The key observation needed to obtain the geodesies is 
the commutation relations between the matrix subspaces 
S, r and Q, 

[S,T] C T (34) 
[S,Q] C Q (35) 
[T,Q] C T. (36) 

Note that the derivation which follows depends only 
on these commutation relations, and not on the spe- 
cific choice of S as one-body Hamiltonians, etcetera. It 
would be interesting to obtain other examples, outside 
the three-qubit context, where this algebraic structure 
appears naturally. We do not know whether this par- 
ticular structure is ever of computational interest in the 
large n limit. We note that this general approach of us- 
ing algebraic structure to obtain insight into geometry is 
reminiscent of the work of Khaneja, Glaser and Brock- 
ett who make use of symmetric spaces to solve geo- 
metric problems involving two qubits. 

Defining S = S(L), T = T(L), and Q = Q(L), we see 
from the commutation relations of Equations (|34|) - (f36|) 
that the geodesic equation L = i[L,!F(L)] becomes: 

S = (37) 

t = z[(l- s - 1 )^+(l- <z - 1 )Q,T] (38) 

Q = iiq- 1 -s-^Q]. (39) 

To solve these equations, observe that S is a constant 
of the motion. This makes the equation for Q a linear 
equation that is easily solved. The equation for T is then 
a time-dependent linear equation that can be solved using 
standard techniques. The resulting solution is 

S(t) = So (40) 
T(t) = e it ( ( i~ 1 ~ s ~ 1 ') s <>e it< - 1 ~ c i~ 1 ^ So+Qo ' > x 

X oe ~ it ( 1 ~l' 1 )(So+Qo) e -it(q- 1 -s- 1 )S (A]\ 

Q(t) = e it{q ' 1 - s ' 1)So Qoe- it(q ' 1 - s ~ 1)So . (42) 
The corresponding Hamiltonian has the form: 

H(t) = s- 1 S(t)+T(t) + q- 1 Q(t). (43) 
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This expression for the Hamiltonian holds for all q and 
s. We now show how to integrate the corresponding 
Schrodinger equation in the large q limit to obtain the 
geodesic U(t). 

Without loss of generality we can assume that we are 
working on a geodesic with (H(t),H(t)) = 1 for all 
time. As a result we obtain the bounds tr(S' 2 )/2 3 < 
s,tr(T 2 )/2 3 < 1, and tr(Q 2 )/2 3 < q. The term q- x Q{t) 
is therefore of order q" 1 / 2 , and thus may be neglected in 
the large q limit, with a resulting error in U(t) of order 
tq~ x l 2 . For similar reasons, we can neglect the q~ x terms 
in the exponentials appearing in T(t). The resulting er- 
ror in Tit) is at most of order t(s 1 ^ 2 q^ 1 +q~ 1/>2 ), and thus 
the error in U(t) is at most of order i 2 (s 1/,2 <7 _1 + q~ x / 2 ). 
This leads us to define an approximate Hamiltonian 



H(t) = a' 1 So + er lts ^v^So+Qo) x 

rp e -it(S +Qo) gits^ 1 So 



(44) 



The corresponding solution U (t) to the Schrodinger equa- 
tion satisfies 

\\U(t)-U(t)\\ < Oitq- 1 / 2 +t 2 {s 1 / 2 q- 1 + q- 1 / 2 )). (45) 

Making the change of variables V = e - it (So+Qo) e its- 1 s jj 
we see that the Schrodinger equation is equivalent to 



V = -i (So + To + Q ) V. 



(46) 



The approximate solution to the geodesic equation is thus 



U(t) = 



_ -its 1 S Jt(S +Q ) p -it(S +T +Qo) 



(47) 



Although this form is an exact solution to the geodesic 
equation in the q — > oo limit, it is not obvious which 
is the minimal geodesic passing through a particular de- 
sired unitary U. Developing techniques to find minimal 
geodesies in this case is an interesting problem for further 
work. 

The special case s — > is of some interest in our three- 
qubit example, where it corresponds to zero cost for local 
unitary operations. In this limit the 5*o terms in the 
second and third exponentials of Equation (|47|) may be 
neglected, and we obtain the solution: 



U(t) 



So gitQo e -it(T +Q ) 



(48) 



Returning to the case of general s, we now attempt to 
simplify the expression in Equation (|47|) . Generically, we 
expect that 5*0 + Qo is large compared with To, and 5*0 + 
Qo is non-degenerate. First-order perturbation theory 
can be used to simplify the product of the final two terms 
to obtain 



U(t) 



-its 1 S -itTZs +Q {T a ) 



(49) 



where TZs +q (To) denotes the diagonal matrix which re- 
mains when we work in the eigenbasis of So + Qo and 
remove all off-diagonal entries from Tq. Assuming Qo is 
nondegenerate, in the s — > limit we obtain 



U(t) 



l S -itn Qo (T ) 



(50) 



3. Formal solution of the geodesic equation 

In this section we develop a formal power series so- 
lution to the geodesic equation. The formal solution is 
most easily developed for the dual L(t) which satisfies 
the equation L — i[L, F(L)). We expand L(t) in a power- 
series, 



L(t) 



E 

3=0 



(51) 



where L^'(0) is the j'th derivative of L(t) at t = 0. 
This derivative can, in principle, be evaluated using the 
geodesic equation. It is rather inconvenient to do this 
directly. However, it can be done easily using the vector- 
ization technique, whereby matrices are converted into 
vectors, and linear operations taking matrices to matri- 
ces become matrix operations taking vectors to vectors. 
We assume readers are familiar with vectorization (see, 
e.g., Chapter 4 of 0). 

We will write the formal solution for any equation of 
the form L = £(L, L), where £(■, •) is a bilinear operation. 
This class of equations includes the geodesic equation for 
any right-invariant metric. Vectorization of this equation 
yields 



d\L) 
dt 



E(\L)®\L)), 



(52) 



where \L) is the vectorized form of the matrix L, and E 
is the vectorized form of the bilinear operation Z. Note 
that E is a linear operation mapping from the tensor 
product of two copies of the space on which \L) lives 
into a single copy of that space. For the class of opera- 
tions £ arising from the geodesic equation, standard vec- 
torization techniques show that E has the explicit form 
E = iR(I — S)(I <E> F), where F is the vectorized form of 
J 7 , S swaps the factors in the tensor product, and R is 
defined by R(\X) ® \Y)) = \XY), where XY is the usual 
matrix product of X and Y. 

Taking repeated derivatives, it follows that the j'th 
derivative may be written 

\L {l) ) = E{E®I + I®E)... 

(E (g) /^O'- 1 ) + I <g> E ® I®(3- 2 ) + ...) 
\L)^ +1 \ (53) 

where it is understood that \L^) and \L) are evaluated 
at time t = 0. To simplify this expression, observe that: 



E®I® 1 = S 1M1 (E ® I^ k+l ^Tr, 



(54) 



where Si t k swaps systems 1 and k, and acts trivially on all 
other systems, and it is some permutation of the systems. 
It follows that if \X) is a vector in the entire tensor prod- 
uct space such that \X) is symmetric under interchange 
of any of the systems, then we have 

(I® k <g> E®I® l )\X) = 5' 1 , fc+1 (£:®/®( fe+ '))|X), (55) 
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where we used the fact that n\X) — \X) for all permu- 
tations 7r. Define an operator T m acting on m systems 
by 

T m = I + S li2 + ... + S lim , (56) 

where it is understood that each swap Si j acts on m 
systems. Our earlier expression for \L^'), Equation (f53|) . 
may now be rewritten as 

\L U) ) = ET 2 (E ® I)T 3 (E <g> I m ) 

. . .Tj(E ® I®V-V)\L)®U+i) , (57) 

Thus we have the desired formal expression for the vec- 
torized solution \L(t)) to the equation L = £(L, L). 

3=0 3 ' 

V. GEODESIC DEFORMATION AND 
CONJUGATE POINTS 

A central problem in developing our geometric ap- 
proach to quantum computation is to find a minimal 
geodesic from the identity / to a specified unitary U . In 
this section we develop two sets of tools that can be used 
to make progress towards the solution of this problem, 
and illustrate these tools through numerical examples. 

The first set of tools are known as Jacobi fields and 
conjugate points. They are standard tools in Rieman- 
nian geometry, and relate the global problem of deter- 
mining when a geodesic is minimizing to local curvature 
properties of the manifold. We will use these tools to 
give explicit examples of geodesies which are provably 
not minimizing, i.e., they can be used to find examples 
of curves which are local length minima, but which are 
not global length minima. For example, for geodesies of 
the form exp(— iHt), with H containing only one- and 
two-body terms, we use conjugate points to find values 
of t beyond which these geodesies are not minimal. 

The second set of tools is aimed at solving the geodesic 
equation with fixed endpoints I and U. Traditional meth- 
ods for solving this two-point boundary value problem 
(e.g., shooting methods) do not work so well, since the 
space we are working in has extremely high dimensional- 
ity. The idea we use is to deform the geodesies from the 
q = 1 case, where the form of the geodesies is well under- 
stood, to much larger values of the penalty, e.g. q = 4™. 
We will show that this deformation can be achieved us- 
ing a generalization of the Jacobi equation, which we call 
the lifted Jacobi equation. The lifted Jacobi equation en- 
ables us to define a notion of geodesic derivative, which is 
a way of deforming the geodesic as the penalty q is var- 
ied, without changing the endpoints. The lifted Jacobi 
equation, the geodesic derivative, and the deformation 
algorithm are all original, so far as we are aware. 

The detailed structure of the section is as follows. We 
begin in Subsection IV Al by deriving the lifted Jacobi 



equation, and obtain as a special case the standard Jacobi 
equation. This is done on a general Riemannian mani- 
fold. In Subsection IV Bl we give explicit forms of these 
equations which are applicable to the standard metric. 
In particular, the lifted Jacobi equation describes how 
geodesies deform as the parameter q is varied in the stan- 
dard metric. Subsection lV Cl uses the conventional Jacobi 
equation to numerically investigate conjugate points, and 
to find examples of geodesies which are provably not min- 
imizing. Subsection IV Dl defines the geodesic derivative, 
and studies its basic properties, including obtaining nec- 
essary and sufficient conditions for the geodesic derivative 
to exist. The geodesic derivative is then applied in Sub- 
section IV El to obtain a numerical procedure for finding 
geodesies between I and a specified goal unitary, U. We 
illustrate this procedure with some numerical examples. 

A. Lifted Jacobi equation 

Suppose 7(i) is a geodesic on a smooth manifold, M, 
with respect to some metric, g, and we smoothly change 
that metric. Intuitively, it seems it should be possible to 
smoothly deform the geodesic curve so that it remains 
a geodesic with respect to the new metric. The lifted 
Jacobi equation provides a way of making this intuition 
rigorous. It generalizes a well-known tool of Riemannian 
geometry known as the Jacobi equation, which describes 
the behaviour of nearby geodesies of a fixed metric. 

We develop the lifted Jacobi equation on a general Rie- 
mannian manifold, M, and specialize later to cases of in- 
terest in the context of quantum computing. We suppose 
g s is a family of metric tensor fields for M parameterized 
by a single real parameter, s, and smooth with respect 
to any fixed co-ordinate system. Define a (°) symmetric 
tensor field g' to be the pointwise derivative of g with 
respect to s at some fixed value of s, say s = 0. 

Our strategy is as follows. Imagine 7(5, t) is a smooth 
family of curves on M such that 7(5, •) is a geodesic with 
respect to the metric g 8 . We call 7(0, •) the base geodesic, 
and define the lifted Jacobi field J(t) G T 7 ( t )M along 
the base geodesic by 

J(t) = 0.7(0, i). (59) 

The lifted Jacobi field is the vector field telling us how 
the base geodesic is locally deformed as s is varied. We 
will show as a consequence of the geodesic property that 
J(t) satisfies the lifted Jacobi equation, which is a second 
order differential equation. Conversely, given a solution 
to the lifted Jacobi equation, it is possible to define a 
corresponding family of deformed geodesies. 

To derive the lifted Jacobi equation we expand 7(A, t) 
in a co-ordinate representation as 

1 (A,t)^ 1 (0,t) + AJ(t) + O(A 2 ). (60) 

By definition 7(A, t) satisfies the geodesic equation asso- 
ciated to the metric g&. Substituting into the geodesic 
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equation, expanding in powers of A, and considering the 
term linear in A gives 



= 



dP 
W 



r 3 

1 kl 



. p3 jr, 



Qjk Q^l 

~~dt~dt 

dj k d'y 1 
dt dt 



r 3 

1 kl 



Qjk Qjl 

~dt~dt 



dT J kl dj k dj l 
~ds~~dt~dt' 



(61) 



The standard Jacobi equation corresponds to the case 
where g s is constant, i.e., to the case where the first four 
terms on the right-hand side of the above equation sum 
to zero. This allows us to rewrite the above equation in 
more geometric terms as 



(Dpy + (R(j,j)j) j 



OS 



0. 



(62) 



The first two terms here are just the standard terms ap- 
pearing in the conventional Jacobi equation, with -y(i) = 
d<y(0,t)/dt and J(t) ee J(0,i). Note that R here is a Q) 
tensor field formed by raising the last index of the Rie- 
mann curvature tensor, and thus has components R- kl . 
A lengthy and tedious but essentially straightforward cal- 
culation can be used to verify that these first two terms 
correspond to the first four terms in Equation (|61|) . 

Equation (|62p can be rewritten in a still more natural 
geometric form. A calculation shows that 



ds 
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jm 



imk:l 



iml:k 



9kh; 



(63) 



where g' mk .i is the standard notation for the covariant 
derivative of the tensor field g 1 . We see from this equa- 
tion that dT 3 kl /ds is a ( 2 ) tensor field. This is a remark- 
able fact, given that T J kl is not a tensor field. (A simple 
alternate proof that dTij/ds is a tensor field may be ob- 
tained by taking the partial derivative with respect to 
s of the standard (non-tensorial) transformation law for 

no- 

Putting it all together, we obtain the lifted Jacobi 
equation 



family j(s,t) of geodesies for g s with J(t) as the corre- 
sponding lifted Jacobi field. 

Solution to the lifted Jacobi equation: The lifted 
Jacobi equation is a linear, inhomogeneous second or- 
der differential equation, and thus it is possible to write 
a solution to the equation in terms of time-ordered inte- 
grals along the geodesic. In co-ordinates the lifted Jacobi 
equation, Equation (|64p . may be written as 



<fj_ 

dt 2 



A dJ 
A— 

dt 



BJ + C = Q, 



(66) 



where A and B are time-dependent matrices, and C is a 
time-dependent vector. This may be rewritten as a first- 



order system by setting Ji 
so 



J, J2 = J, and K 



dK 
~dt 



I 

-B -A 



K - 



(67) 



Let E t denote the propagator describing the solution to 
this equation in the homogeneous case, i.e., when C = 
we have K(t) = E t K(0). This corresponds to the solu- 
tion of the conventional Jacobi equation. Note that E t is 
a time-ordered exponential which may be studied using 
standard techniques. The solution in the inhomogeneous 
case is then 



K(t) = E t K(0)-E t / drE, 





C(r) 



(68) 



This expression shows that the solutions to the lifted Ja- 
cobi equation may be obtained from the propagator E t 
for the conventional Jacobi equation, and an integral in- 
volving an expression C(-) determined by g'. An inter- 
esting special case of the solution arises when we pick 
K(0) = 0, which corresponds to keeping the initial posi- 
tion and tangent vector to the geodesic unchanged, and 
looking to see how the geodesic deforms. We obtain in 
this case 



Kit) 



-E t 



drEZ 





C(r) 



(69) 



(D 2 jy + (R(j,m j + = 0, 



where 



C j = 



I'mk-l 



9ml;k 



9kh: 



i k i l 



(64) 



(65) 



is a vector field that does not depend on the lifted Jacobi 
field P. Note that the terms g' mk -ii k i l an d g' m i-ki k i l 
appearing in C 3 are equal, which may be used to simplify 
the form of CK 

We have shown that given a family of curves 7(s,i) 
such that 7(5, •) is a geodesic of the metric g s , the cor- 
responding lifted Jacobi field J(i) must satisfy the lifted 
Jacobi equation, Equation It is straightforward to 

turn this reasoning around, and argue that for any solu- 
tion J(t) to the lifted Jacobi equation there must exist a 



B. Lifted Jacobi equation for varying penalty 

In the last section we derived the lifted Jacobi equa- 
tion for a general parameterized family of metrics on 
a Riemannian manifold, M. In this section we derive 
and present a formal solution to the lifted Jacobi equa- 
tion for a parameterized family of right-invariant metrics 
such as arise in the context of quantum computation. 
Specifically, we choose the parameterized family of met- 
rics Q q = V + qQ, so that Q' = Q. We are able to obtain 
an explicit solution to the corresponding lifted Jacobi 
equation along geodesies for which the Hamiltonian H is 
constant. 

One way of approaching this task is to begin with the 
lifted Jacobi equation in the form derived in the last sec- 
tion, Equation (|64[) . In fact, for right-invariant metrics 
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on SU (2™) there is a simpler alternate approach. We sup- 
pose H (t) is a Hamiltonian generating a geodesic U{i) for 
the metric Q qi and that there is a nearby G q +A geodesic 
of the form 



U(t) = U{t)e- tAJ(t \ 



(70) 



for some small A. We will write the lifted Jacobi equation 
as a second order differential equation for J(t). To de- 
rive this equation, rather than start from Equation (|64[) . 
which requires converting J(t) into a suitable co-ordinate 
representation, and then computing all the relevant quan- 
tities, it is easiest to rework through the strategy in the 
last section, but working directly in terms of the quantity 
J(t) rather than some co-ordinate representation. 

To do this, we use Schrodinger's equation to deduce 
that the Hamiltonian generating U(t) is, to first order in 
A, 

H(t) = H(t) + AU(t)j(t)U\t) + 0(A 2 ). (71) 

We require that H{t) satisfies the geodesic equation for 
Gq+A- To see what this implies we set L = G{H), where 
G = G + AG' + 0(A 2 ). Substituting into the geodesic 

equation L = i[L,H], and examining the terms linear in 
A, we obtain 



= K + T{i[K,L] + i[H,g{K)} 

+(g'or)(i[L,H]) + i[H,g'(H)])) 



(72) 



where K = UJU^ is the first-order perturbation to the 
Hamiltonian. This equation is an inhomogeneous first 
order differential equation linear in K and thus can be in- 
tegrated using standard techniques, and then integrated 
again to obtain J(t). The conventional Jacobi equation 
corresponds to the case where G 1 = 0, and thus the last 
two terms vanish. Note that we have used no special fea- 
tures of the standard metric in our derivation, and this 
form of the lifted Jacobi equation holds for any right- 
invariant metric. 

The case of constant H : Along geodesies where H is 
constant it is possible to obtain a closed form expression 
for the solutions to the lifted Jacobi equation correspond- 
ing to the standard metric, i.e., with G' = Q- 

To see this, we observe first that the solution to the 
Jacobi and lifted Jacobi equations coincide when H is 
a constant. This is because the inhomogeneous con- 
tribution {Tog' o T){i[L,H\) + T{i[H,g'{H)]) to the 
lifted Jacobi equation vanishes, since [L, H] = and 
[H,g'{H)] = [H, Q{H)j = along a geodesic with con- 
stant H. An interesting consequence is that if we choose 
J(0) = and j(0) = 0, then J{t) = for all time, i.e., 
the geodesic does not deform as the metric is varied. In 
other words, the geodesies for which H is constant are 
the same for all q, as can also be seen from the condition 
derived in Section HV Al 

We may thus vectorize the lifted Jacobi equation, ob- 
taining \K ) = iA\K ), with 

A = F [{I ® L - L T ® I) + {H T ® I - I ® H)G] , 

(73) 



where G and F are the vectorized forms of G and T, 
respectively. The solution to this equation is \K{t)) — 
e iAt \K{0)). Vectorizing j = W KU and substituting 
U = e~ tHt gives 

\j{t))=e lBt e lAt \j{Q)). (74) 

where B = I <E> H — H T <E) I- Integrating we obtain 



\m = i j(o)) 



dr e lBr e iAl 



|J(0)). 



(75) 



This integral can be performed explicitly by using a sec- 
ond level of vectorization, this time acting on matrices 
in the space in which A and B live. We denote this vec- 
torization operation using vec, to distinguish it from the 
map X — > \X), and use unvec to denote the inverse op- 
eration. The integral can now be evaluated to yield the 
explicit solution to the lifted Jacobi equation, 



\J(t)) = |J(0)) 



+unvcc 



e i(A J ®I+I®B)t _ j 

i(A T <Z> I + I ® B 



-vec{I) 



(76) 
|j(0)). 



This expression looks daunting, due to the multiple layers 
of vectorization, but is actually quite simple. 



C. Conjugate points 

In this section we use the theory of conjugate points 
to derive conditions under which geodesies are no 
longer minimizing. In particular, we numerically study 
geodesies of the form e~ lHt , where H is a fixed two-body 
Hamiltonian, and use conjugate points to derive condi- 
tions on t such that the geodesic from / to e~ lHt is only 
a local minimum of the length, not a global minimum. 
This work will also be useful in our later discussion of 
the geodesic derivative. 

Recall the definition of conjugate points from elemen- 
tary Riemannian geometry Q . Two points x and y along 
a geodesic are said to be conjugate if there exists a non- 
zero Jacobi field defined along the geodesic which van- 
ishes at both x and y. If we write the propagator for the 
Jacobi equation in block form as 



E, 



so that the solution is 



Ei E2 
E3 E4 



J{t) 
j{t) 



E, 



J(0) 
j(0) 



(77) 



(78) 



then we see that the points at and t are conjugate along 
the geodesic if and only if E 2 is singular. 

Suppose now that we begin at a point x and move 
along a geodesic. Let t c > be the first time we pass 
through a point y conjugate to x, assuming such a point 
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exists. This point is of particular interest, because it can 
be shown (see [TH, pp 268-270) that past the first con- 
jugate point the geodesic is no longer minimizing. Thus, 
the propagator E t associated with the Jacobi equation 
provides a computational machine which lets us deter- 
mine when geodesies are no longer minimizing . 

To illustrate these ideas we analyze geodesies of the 
form e~ lHt , where if is a sum of one- and two-body 
terms. Intuitively, over short times we expect that the 
fastest way to simulate a physical system is with its 
own evolution. This intuition is confirmed by the fact 
f[l9|. pp 222-226) that over sufficiently short time pe- 
riods geodesies are guaranteed to be globally minimal 
paths. 

However, over longer time periods this is no longer 
the case. A simple illustration is the unitary e~ lZt . For 
t < 7r/2 this can be implemented by applying the Hamil- 
tonian H = Z for a time t; for ir > t > it/2 it is more 
efficient to apply H = —Z for a time ir — t. More gen- 
erally, for any Hamiltonian it is true that e~ lHt « I for 
sufficiently large t, and so sufficiently long geodesies are 
never minimizing. 

The method of conjugate points offers a powerful gen- 
eral way of studying when geodesies are no longer glob- 
ally minimal. In particular, by integrating the Jacobi 
equation it is possible to determine to good accuracy 
when two points are conjugate using computational re- 
sources that scale polynomially with the dimension of 
the underlying manifold, which is 2°W in this instance. 
By contrast, the volume of the space of paths scales as 
2°( 2 ). It is therefore a priori quite remarkable that it 
is possible to prove a geodesic is not minimizing using 
2°(. n ) computational resources. 

In general, finding conjugate points seems to require 
numerical solution of the Jacobi equation, perhaps using 
an explicit solution such as Equation (|76| . valid in some 
special case. In the bi-invariant case, i.e., when q = 1, 
it is possible to write an analytic solution. In particular, 
Equation ([76]) simplifies because A = 0, and it is easily 
verified that conjugate points occur at times 



where m is a non-zero integer, and Xj and Xk are distinct 
eigenvalues of H. 

Of course, the case of computational interest is when 
q ^> 1. As an example of this case we consider the trans- 
verse Ising Hamiltonian in one dimension, 

// X 7 ' 7 ' 1 • h Y, x >- ( 8 °) 

3 3 



5 Note, however, that while the conjugate point condition is suf- 
ficient to say a geodesic is no longer minimizing, it may not be 
necessary — there could already be a (globally) shorter path 
before a conjugate point is encountered. 




t 



FIG. 1: Log plot of the absolute value of the minimum eigen- 
value of E2 (A m in(£ ; 2)) versus time, for the transverse Ising 
model with external field h = 1. The penalty parameter is 
in the regime of computational interest, q — 4™ = 64. Sharp 
dips indicate conjugate points. 

where h is the strength of the applied field. We nu- 
merically investigated conjugate points for the case of 
n = 3 qubits, with external field h = 1, and penalty 
q = 4™ = 64 chosen to be in the regime of computational 
interest. Figure [1] is a log plot of the minimum eigenvalue 
of E2 versus time. The occurrence of sharp dips in this 
plot strongly suggests the presence of a conjugate point. 
We see that in this example the first conjugate point oc- 
curs at t c k 1.54, and thus the geodesic is no longer min- 
imizing past this time. What is remarkable about this 
observation is that we have deduced the non-minimizing 
property without explicitly finding a shorter geodesic. 
We do it instead through the (relatively) computationally 
easy process of studying the conjugate points. It would 
be an interesting challenge to generalize this procedure 
to arbitrary n, perhaps using the known analytic solution 
to the transverse Ising model. 

D. Geodesic derivative 

Suppose 7(t) is a geodesic passing through the point x 
at t = and y at t = T, and we vary the metric while 
holding the endpoints fixed. We show in this section 
that provided x and y aren't conjugate along the geodesic 
7, the geodesic deforms in a unique way that can be 
described by an object we call the geodesic derivative. 

To define the geodesic derivative, suppose j(s, t) is a 
family of geodesies, with 7(5, •) being a geodesic for the 
metric g s , as in Subsection IV Al We suppose j(s, t) has 
the constraints 7(0, t) = 7(f), and 7(s,0) = x,-y(s,T) = 
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y for all s. We define the geodesic derivative as D7 = 
dgdt'f £ T X M at s = and £ = 0. Note that 9 S is defined 
here, since dtj lives in the same vector space, T X M, for 
all values of s. The geodesic derivative thus represents 
the way in which the initial tangent dtj is changing as 
the parameter s is changing near s = 0. 

Theorem 1 Let x = 7(0) and y — 7(T) be endpoints on 
a geodesic "f(t). Then a corresponding geodesic derivative 
Dj exists and is uniquely defined if and only if x and y 
are not conjugate along 7. 

Intuitively, at a conjugate point geodesies in a fixed 
geometry "split" into many nearby geodesies (consider, 
e.g., antipodal points on a sphere). Thus, in one direc- 
tion this theorem is not surprising: we expect conjugate 
points to give rise to many different ways to deform a 
geodesic as the metric is changed. The converse, how- 
ever, is rather less obvious. 

Proof: The existence and uniqueness of such a Dj is 
equivalent to the existence of a family j(s,t) of geodesies 
satisfying the appropriate endpoint conditions, and such 
that the geodesic derivative is the same for any such fam- 
ily. Observe that if such a D7 exists, then = J(0), 
where J(i) is a lifted Jacobi field. Thus Dj exists and 
is unique if and only if the lifted Jacobi equation has a 
unique solution satisfying J(0) = J{T) = 0. Comparing 
with the earlier solution to the lifted Jacobi equation, 
Equation (|68|). we see that this is equivalent to there ex- 
isting a unique D 7 satisfying the constraint: 



£ 2 D 7 = PE t / dr E. 





C(r) 



(81) 



where P projects onto the top block in the block rep- 



resentation K = 



used in the solution of the lifted 

Jacobi equation, Equation ([55)) . Such a unique solution 
Z?7 exists if and only if E2 is invertible. We saw in the 
last section that this is equivalent to x and y not being 
conjugate along "f(t). QED 

Note that our proof shows more generally that any D7 
satisfying Equation ([81]) is a valid geodesic derivative, 
even when the geodesic derivative is not uniquely defined. 
Thus, the analysis of the geodesic derivative is closely tied 
to understanding the kernel of E2. 



E. Numerically finding geodesies 

In this section we explain how the geodesic derivative 
may be used to find geodesies reaching a particular de- 
sired target unitary, U . The procedure used is to begin by 
picking a Hamiltonian H(Q) which generates U at some 
fixed time T along the q = 1 geodesic. This may be 
done by picking H(0) so that U = e~ %H ^ Q ' T ', i.e., by com- 
puting logarithms. We now vary the parameter q in the 
family of metrics Q q = V + qQ, causing a corresponding 
change dH q (0)/dq = in the initial Hamiltonian. Pro- 
vided the geodesic derivative Dj exists and is unique for 



a suitable range of values of 3, we can integrate to ob- 
tain an initial Hamiltonian H q (0) generating a geodesic 
connecting / and U, for any desired value of q. 

To implement this procedure we need to develop a 
method to compute Z3 7 . We could do this using Equa- 
tion (|81[). but in the case of right-invariant metrics on 
SU(2 n ) a more computationally convenient form is pos- 
sible, which we now derive. Recall that D 7 is defined to 
be a value of J(0) such that when J(0) = 0, the solution 
to the lifted Jacobi equation satisfies J(T) = 0, i.e., no 
variation occurs at the endpoint as q is varied. To analyze 
the values of J(0) for which this occurs, we examine the 
solution to the lifted Jacobi equation for right-invariant 
metrics more explicitly. Observe that since J(0) = we 
have 



dtJ(t)= dtU\t)K(t)U(t), (82) 



J(T) 



whe re K (t) = U[t)j[t)U{t)\ as defined in Subsec- 
tion E32 The lifted Jacobi equation, Equation ([72]) . has 
solution 



(83) 



K(t) = fCt{K{0))-]C t ( J drfC-\C(r)) 



where C is the inhomogeneous part of Equation ([72]) . 
and /Ct is the propagator for the homogeneous form of 
Equation (j72|) . i.e., for the standard (not lifted) Jacobi 
equation. The metric derivative along our family is Q' = 
Q, and a calculation shows that 

C = T 2 {i[V{H),Q(H)]). (84) 

Substituting Equation (|53")) into Equation (1521) , we obtain 



J(T) - J T (j(0)) 

dtU{t)^K t (^J^ drK,- 1 (Circuit), 

(85) 

where Jt is the propagator that generates the standard 
(not lifted) Jacobi field J sta nd(? 1 ) = Jr (-/stand (0)), as- 
suming that Jstand(O) = 0. Requiring that J(T) — and 
identifying dH q (Q)/dq = D 7 = J(0), we obtain 



dH q (0) 
dq 



f dtU(tyiCt Qf drK,- l {C(r)) \ U(t) 



This equation can be simplified by observing that 



-Kt I J drJC- L (C(r)) 

it[Q(H),P(H)}, 0=1, 
()C t (L(0))-L(t))/q(q-l), q>l 



(86) 



(87) 
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The top equation can be verified by noting that for q = 1 
K-t is the identity operation for all time and C is constant. 
For the bottom equation it is sufficient to check that both 
sides solve Equation ([72)1 with q > 1 and initial condition 
K(0) = 0. Substituting into Equation ([86)) and using 
W{t)L(t)U(t) = L(0), we obtain 



( J T X (/ T dt U^t)it[V(H), Q{H)]U{t)) , g=l, 
\ (J^\L(Q))T-L(0))/q(q-l), q>l, 

This is our desired expression for the geodesic derivative. 
In practice, we find it more convenient numerically to 
work with the corresponding expression for dL q (0)/dq, 
which is easily obtained from this expression using the 
chain rule. 

Numerical examples: We now illustrate the 
geodesic deformation procedure for two examples. The 
first example is unitary operations chosen at random, and 
the second example is the quantum Fourier transform. 

For the first example, we choose a three-qubit uni- 
tary operation, U , according to the Haar measure. Then 
we define a unique corresponding canonical Hamiltonian, 
-ffcanon, which satisfies U — exp(— iH canon T) and has 
all eigenvalues in the range (—tt/T,tt/T]. We use this 
canonical Hamiltonian as our initial condition, since it 
has the desirable property that the geodesic U(t) = 
exp(— iH canon t) has no conjugate points before t = T, 
and thus is a likely candidate for the shortest geodesic 
through U — U(T) when q = 1. The results obtained 
when we apply the deformation procedure are illustrated 
in Figure [2] Empirically we find that for typical U, if we 
start with the canonical Hamiltonian and deform to large 
values of q we never encounter conjugate points, and so 
the deformation is uniquely defined. This agrees with the 
general intuition that conjugate points are rare. We also 
empirically observe (but have not proved) that the value 
of the dual Hamiltonian L q (0) converges for large q. 

Other choices for the starting Hamiltonian are possible 
by adding multiples of 2tt/T to the eigenvalues of the 
canonical Hamiltonian. Thus the set of possible starting 
Hamiltonians that reach a desired unitary for q = 1 at 
time t — T has the structure of a (displaced) lattice. 
In contrast to the canonical Hamiltonian, however, our 
numerical results indicate that conjugate points at t — 
T do sometimes occur for some of these other starting 
Hamiltonians, and so the deformation procedure is not 
always well defined. 

Somewhat remarkably, in view of this fact, is that our 
procedure still works numerically, even when conjugate 
points appear at t = T. In particular, if we take ad- 
vantage of the fact that numerically the propagator Jt 
is never exactly singular, then it is still possible to in- 
vert, and we can numerically integrate straight through 
the range of values of q where (presumably) a conjugate 
point occurs. Although we do not know how to justify 
this mathematically, we find empirically that our algo- 



rithm still reaches the desired target unitary. It seems 
likely that what is going on is that our numerical pro- 
cedure is picking out one possible way of doing the de- 
formation. In principle, of course, it may be that no 
such deformation exists, but we have not encountered 
any circumstance where this appears to be the case. An 
interesting observation is that in contrast to the canoni- 
cal case, we find that the initial dual Hamiltonian, L q (0), 
tends not to converge for large q, but continues to grow 
in norm. 

In our second example, we generate a geodesic reaching 
the unitary that implements the quantum Fourier trans- 
form on three qubits. Again we start with the canonical 
Hamiltonian for this unitary. The deformation is illus- 
trated in Figure [3j and only shows the deformation up 
to q = 16 in order to highlight the interesting behavior 
around q = 6. For values of q in the range 1 through w 6 
there is a set of Pauli components of L q (0) that remain 
zero. At q « 6 all of these Pauli components suddenly be- 
come non-zero. This phenomena coincides with the prop- 
agator Jt becoming very nearly singular (the magnitude 
of its smallest eigenvalue is approximately 10~ 6 ), and it 
remains nearly singular up to the final value of q = 16 
as illustrated in panel (b). As in the earlier discussion, 
however, we find empirically that applying our deforma- 
tion procedure still appears to generate a valid (though 
non-unique) geodesic derivative, and this is supported by 
the fact that we do indeed obtain valid geodesies to the 
final target unitary, the quantum Fourier transform (see 
inset of panel (b)). 

Finding the minimal geodesic: We have used the 
geodesic deformation procedure to obtain upper bounds 
on the distance d(I, U) for values of q of computational 
interest. Of course, there are many geodesies for any 
given U , not just the geodesies beginning with the canoni- 
cal Hamiltonian H canon , which is the case we have focused 
on. Can the geodesic deformation procedure be used to 
obtain values for the distance d(I, U)l 

An idea for how to do this is as follows. Imagine we are 
trying to determine whether a geodesic of length < n k 
(say) exists for U. It is clear that the length of geodesies 
monotonically increases with q under deformation. At 
q = 1 this allows us to restrict our attention to a finite 
(2°(™ )) set of possible initial values for the Hamilto- 
nian, and study how the corresponding geodesies deform. 
A better understanding of the way conjugate points be- 
have under deformation may enable us to substantially 
narrow this range of choices. 

A number of caveats to this approach need to be noted. 
The first is that for particular choices of unitary, for ex- 
ample the quantum Fourier transform above, the defor- 
mation procedure produces conjugate points at the end- 
points, and so the deformation procedure is not well de- 
fined. A possible way around this difficulty is to consider 
deforming the metric with two or more parameters, in- 
stead of just one. It seems plausible that it may always 
be possible to deform the metric in such a way that new 
conjugate points never appear along the geodesies, the 
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FIG. 2: Geodesic deformation to a randomly-chosen unitary 
on n — 3 qubits. Panel (a) shows how the Pauli components 
of the initial dual Hamiltonian, Z, (0) = tr(crLq (0))/2 n , vary 
with the penalty parameter up to q — 4" = 64. Of the 4™ = 
64 Pauli components, only 16 representatives are shown, for 
clarity, but all converge in the large q limit. Blue lines are 
components where a £ V , red lines are where a € Q. The 
inset shows how the length of the geodesic segment from / to 
U varies with q. Panel (b) shows the minimum eigenvalue of 
the vectorized form of the propagator Jt as a function of time 
along the geodesic found for q = 64. No conjugate points are 
evident. The inset shows the operator norm of the difference 
between the target unitary U, and U(t) along the q — 64 
geodesic, showing that the target is indeed reached at the 
final time T = 1 



FIG. 3: Geodesic deformation to the quantum Fourier trans- 
form on n = 3 qubits. Panel (a) shows how the Pauli compo- 
nents of the initial dual Hamiltonian, l a q (0) = tr(o\Lg (0))/2 n , 
vary with the penalty parameter up to q = 16. The inset 
shows how the length of the geodesic segment from I to U 
varies with q. Blue lines are components where a £ V, red 
lines are where a £ Q. Panel (b) shows the minimum eigen- 
value of the propagator Jt as a function of time along the 
final geodesic found for q = 16. The sharp dip at the final 
time T = 1 indicates a conjugate point. The inset shows the 
operator norm of the difference between the target unitary U, 
and U(t) along the q = 16 geodesic, showing that the target 
is indeed reached at the final time T = 1 
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intuition being that conjugate points are rather rare. 

Secondly, it is possible that as we move to large q new 
geodesies to the target unitary appear that cannot be 
generated as deformations of a q = 1 geodesic. If this 
is the case then it may be difficult to deduce anything 
about the minimal geodesic by deforming. 

Despite these caveats, we emphasize that any geodesic 
between the identity and the unitary gives an upper 
bound on the distance of that unitary from the origin 
and so is potentially interesting. For example, using the 
methods of any geodesic in the large q limit can be 
well-approximated by a sequence of one- and two-qubit 
quantum gates, and it is plausible that gate sequences 
generated in this way may suggest algorithms for comput- 
ing a family of unitaries of which the target unitary is a 
representative (cases of high symmetry would be natural 
candidates). Further study of the mathematical proper- 
ties of the geodesic deformation procedure, including the 
conjectures mentioned above, is needed to provide more 
definitive answers as to its usefulness as a general tech- 
nique for finding minimal curves. It is also desirable to 
compare to other techniques (e.g. [2(|) which can be used 
to find geodesies in spaces of high dimension. 



VI. WHAT CAN GEOMETRY TEACH US 
ABOUT QUANTUM COMPUTATION? 

Motivated by its close connection to quantum gate 
complexity, in this paper we have developed a basic un- 
derstanding of the geometry of the Riemannian metric 
defined by Equation ([2]). However, substantial further 
progress will be required to obtain cither new quantum 
algorithms or to prove limits on computational complex- 
ity. 

In this section we discuss some of the obstacles that 
need to be overcome for this to occur. We begin in Sub- 
section I VI Al with a discussion of the Razborov-Rudich 
theorem, a result from classical computational complex- 
ity that illuminates the difficult of proving lower bounds 
in classical circuit complexity. We describe an analogous 
quantum result, and an interesting corollary, namely, 
that if good classical pseudorandom number generators 
exist, then the problem of determining distances on 
SU(2 n ) according to the standard metric is not (clas- 
sically) soluble in time polynomial in 2". The Razborov- 
Rudich theorem thus poses a considerable barrier to any 
general program for understanding quantum gate com- 
plexity, including our geometric program. 

In Subsection I VI Bl we discuss a second obstacle to the 
use of geometry, namely that the bounds of Equation ([3]) 
apply only for circuits which do not make use of ancillary 
working qubits. While such circuits are of substantial in- 
terest, in general when computing a desired function / or 
unitary U, it may help to introduce extra ancillary work- 
ing qubits. In this Subsection we explain how ancillas can 
be incorporated into the geometric point of view by using 
a canonical extension procedure for unitary operations. 



A. The Razborov-Rudich theorem and the 
computational complexity of finding geodesies 

The Razborov-Rudich theorem [2l[ is a result from 
classical computational complexity theory that poses a 
significant barrier to any general program for under- 
standing gate complexity, either classical or quantum. 
In this section we briefly survey some implications the 
Razborov-Rudich theorem has for the geometric pro- 
gram. The discussion is in the nature of an informal 
outline, since our intent here is merely to outline the 
main ideas, rather than to give the rather extensive for- 
mal definitions which a full discussion would require. In 
general, the full formal details are easy to fill in by ex- 
perts familiar with the Razborov-Rudich theorem. 

In its simplest variant 6 , the Razborov-Rudich theorem 
shows that, loosely, if good pseudorandom generators ex- 
ist, then it is impossible to efficiently distinguish hard- 
and easy-to-compute Boolean functions. 

This statement can be unpacked in three stages. First, 
a "good" pseudorandom generator is here taken in the 
Blum-Micali-Yao sense (see, e.g., Chapter 9 of [HI)- Such 
generators can be show to exist if one-way functions ex- 
ist. So, for example, if the factoring or discrete loga- 
rithm problems arc difficult to solve on a classical com- 
puter, then such generators exist, and the conclusion of 
Razborov-Rudich holds. Second, by hard-to-compute we 
mean a Boolean function whose minimal (non-uniform) 
circuit complexity exceeds some threshold, e.g., n . 
Easy-to-compute means the minimal circuit complexity 
is below that threshold. Note that the threshold can be 
varied somewhat, with the result still holding. Third, by 
an efficient procedure to distinguish hard- and easy-to- 
compute Boolean functions, we mean an efficient classical 
Turing machine which takes as input the truth table for 
the Boolean function, and determines whether it is hard- 
or easy-to-compute. The criterion for efficiency is very 
relaxed: it is that the Turing machine operate in time 
polynomial in the size of the truth table, i.e., in time 
2°(«). 

The idea behind the proof of the Razborov-Rudich 
theorem is easily stated. First, observe following Shan- 
non [23[ (c.f. problem 4.4.14 in (24[) that a randomly 
chosen Boolean function is with high probability hard 
to compute. Second, using a good pseudorandom num- 
ber generator it is possible to construct pseudorandom 
function generators producing Boolean functions which 
appear pseudorandom, but which actually have small 
circuits. Any method for efficiently distinguishing easy- 
from hard-to-compute functions would therefore provide 
an efficient means of distinguishing random functions 
from pseudorandom functions, and this contradicts the 
definition of a pseudorandom generator. As a result, such 



6 See |2l| for full details. Our discussion in this paper is for a very 
simple type of natural proof system, in the language of |21| . 
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a procedure cannot exist. 

It is straightforward to generalize this reasoning to the 
quantum case. In particular, it can be shown that if good 
(classical) pseudorandom generators exist, then there is 
no efficient classical algorithm which can be used to dis- 
tinguish unitary operations that can be synthesized us- 
ing small quantum circuits, and those which require large 
quantum circuits. 

A similar line of reasoning can be applied to the dis- 
tance function (1(1,11). In particular, suppose it were 
possible to efficiently distinguish unitaries U for which 
d(I, U) is large (i.e., exceeds a threshold like n ln ") from 
unitaries for which d(I, U) is small. Such a procedure 
could be used to distinguish a unitary chosen using a 
pseudorandom generator from one chosen truly at ran- 
dom, and this could be used to break the pseudorandom 
generator 7 . It follows that there must be no efficient pro- 
cedure to evaluate the distance function d(I, U) . 

Theorem 2 Suppose classical pseudorandom generators 
exist. Let d(I, U) be the metric on SU(2 n ) induced by 
the standard metric of Equation (0), with q > 4™. Then 
there is no classical algorithm running in time polynomial 
in 2™ and which produces an accurate approximation to 
d(I,U). 

This result is particularly remarkable when one con- 
siders that when q = 1 it is possible to evaluate d(I, U) 
in polynomial time. 

These results are, obviously, rather discouraging. It is 
worth emphasizing that analogous results apply to any 
general approach to quantum circuit complexity, and are 
not special to the geometric approach. In particular, any 
approach to the proof of lower bounds must necessarily 
contend with the Razborov-Rudich theorem. 

Given these results, what is the best approach to find- 
ing unitary operations which can be analyzed using geo- 
metric techniques? We do not know the answer to this 
question. One possibility is to try to use symmetries to 
avoid the obstruction posed by Razborov-Rudich. Sym- 
metries are often used to simplify the analysis of the 
geodesic equation, and may, in some cases, make it pos- 
sible to analyze d(I, U) for those U satisfying the sym- 
metries, without providing a general efficient procedure 
for determining d(I, U). This is currently under investi- 
gation. 

B. Extending the geometric picture to take 
account of ancilla 

A drawback of the results of 0, [U, [f| is that they apply 
only to the synthesis of unitary operations without the 



To make this description a little more precise, suppose we define 
U by U\x) = (— l)f ( x ) \x), where / is a Boolean function that is 
either generated pseudorandomly, or truly at random. 



assistance of ancillary working qubits. We now develop 
a technique enabling the geometric approach to be ap- 
plied to many (not all) unitary operations, even in the 
case of ancilla. In particular, this technique may be ap- 
plied to unitary operations which compute a permutation 
function, |x) — > |/(x)), or which are diagonal in the com- 
putational basis |sc) — > e l0x \x). The technique works by 
showing that quantum circuits using ancilla may be put 
into a standard canonical form which can then be ana- 
lyzed geometrically. 

To make the issue at stake more explicit, suppose we 
wish to synthesize a unitary operation, U, on some num- 
ber, n, of qubits. To do this synthesis it may help to 
introduce m additional ancillary qubits, which start in 
a standard state. Without loss of generality we assume 
this state is the all |0) state, which we denote |0). We 
then attempt to synthesize a unitary operation V such 
that for all n-qubit states, \ip), 

V\i/>)\0) = (Um\A), (89) 

where \A) is some ancilla state. Note that by linearity 
|^4) cannot depend on We call a V satisfying this 
relation an extension of U. Empirically it is found that 
sometimes the gate complexity of synthesizing such an 
extension may be strictly less than the gate complexity 
of synthesizing U without ancilla. 

This situation presents a difficulty for the geometric 
approach, since it suggests that we need to evaluate the 
distance d(I,U), where U is the entire set of extensions 
of U . The set U is rather complex, and it seems likely to 
be far more difficult to evaluate d(I,li) than d(I, U). 

In this section we show how d{I,U) can be accurately 
estimated using distances d(I, U'), for a suitably chosen 
unitary U' = U'(U). This enables us to use distances 
to obtain bounds on the gate complexity of unitary op- 
erations, with ancilla allowed. The constructions we de- 
scribe do not apply for all unitary operations, but they 
do apply for many unitaries of interest, including the uni- 
taries that arise in the computation of classical functions. 

To state our results more formally, let Goo(U) be the 
minimal number of one- and two-qubit gates required to 
synthesize an extension of U, with an unbounded number 
of qubits allowed. We define a special extension of U to 
be an extension V such that the final state of the ancilla 
is the same as the initial state, \A) = 0), i.e., 

VmO} = W))|0). (90) 

We say a special extension is an m-fold special extension 
if the number of ancilla qubits is m. We define G m (U) 
to be minimal exact gate complexity of an m-fold special 
extension of U, and Goo(U) to be the minimal exact gate 
complexity of a special extension of U with an unbounded 
number of ancilla qubits. 

We will show how to obtain bounds on Goo(U) by 
showing that for suitable choice of m there is a single 
rn + 1-fold special extension U m of U such that G(U m ) 
can be used to bound G 00 (J7). Furthermore, we will also 
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show that for many interesting unitaries, including all 
those associated with the evaluation of classical func- 
tions, Goo(U) and Goo(U) behave in essentially the same 
way. 

This allows us to reduce the study of the gate com- 
plexity of U with ancilla to the study of the complexity 
of a fixed unitary, U m , without ancilla. This study can 
then be done through geometric methods, or using any 
other preferred method of analysis. 

The bounds relating Goo(U) to G(U m ) and d(I,U m ) 
go in one direction. Bounds in the other direction, anal- 
ogous to the first inequality in ((3|), may be obtained by 
replacing Goa(U) by an approximate analogue, Goo(i7, e). 

In particular, we define Goo(U,e) to be the minimal 
number of one- and two-qubit gates needed to synthesize 
a unitary operation V such that || V— U\\ < e, where U and 
V are the natural quantum operations on the n-qubit in- 
put space induced by U and V, and ||£|| = max p tr|£(p)|, 
with the maximization over density matrices p; note that 

||V-W||<||tr-v||. 

Theorem 3 There exist positive constants c\ and c 2 
such that for any U and m we can construct an m+l-fold 
special extension U m such that: 

mm{m, Cl G(U m )-c 2 m) < G^U) (91) 
Goo{U,t) < G(U m ,e). (92) 

Proof: Let V be any m-fold special extension of U. 
Consider the circuit: 



(93) 



Note that the first wire represents the n qubits on which 
we desire to implement U, the second wire represents m 
ancilla qubits, and the third wire is a single qubit. Note 
that operations controlled on the second wire are only 
applied if all the qubits in the second wire are set to |0). 

We claim that: (1) this circuit defines an to + 1-fold 
special extension of U; and (2) the action of this exten- 
sion is independent of the choice of special extension, V, 
and is given by the operation U m defined by the circuit: 




Urn — 



(94) 



We call U m the m'th canonical unitary extension of U. 
Note that the fact that U m is an to+ 1-fold unitary exten- 
sion of U follows trivially from the form of this circuit, 
and so the true challenge here is to prove that the ac- 
tion of U m is the same as the action of the circuit in 
Equation (|93| . To verify this it helps to consider sepa- 
rately the cases where y = 0,y ^ and z = 0, z = 1. 
The three cases (i) y = 0, z = 0, (ii) y = 0, z = 1 and 



(iii) y ^ 0, z — 1 all follow from straightforward circuit 
analysis. 

The final case, y ^ 0, z — 0, requires more care. After 
the first gate is applied, the state is (V|x)|?/))|0). The 
critical claim, proved in the next paragraph, is that the 
state V|a;)|y) has zero overlap with any state of the form 
x')|0}. As a result, the second gate has no effect on the 
state of the system, and the third gate merely inverts the 
effect of the first. The final gate has no effect (since y ^ 
0) , and thus the net effect of the circuit is to transform 
|x)|y)|0) to |x)|y}|0), which matches the action of U m . 
This completes the proof. 

To see that V\x)\y) has zero overlap with any state of 
the form |se')|0), observe that \x')\Q) = V(W\x'))\0), and 
thus: 

(x'\(0\V\x)\y) = (x'\U(0\V^V\x)\y) (95) 
= (x'\U\x}{0\y), (96) 

which vanishes since y ^= 0. 

The remainder of the proof of Theorem [3] is relatively 
straightforward. The proof that Goc(U,e) < G(U m ,e) 
follows from the fact that any unitary which approxi- 
mates U m to accuracy e necessarily approximates U to 
accuracy e, using standard arguments about operator 
norms. 

To prove the other inequality, note that without loss 
of generality we can assume that m > Goo(U) (otherwise 
the inequality is trivially true). In this instance observe 
that Goc{U) — G m (U), since it cannot help to have more 
ancilla qubits than gates in a circuit. Let V be the op- 
timal m-fold special extension of U, so G m (U) = G(V). 
Observe that U m can be synthesized using the circuit in 
Equation (JS3J. It follows that G{U m ) < c 1 G(V) + c 2 m = 
c\G m {U) +C2?ti, where the term liner in G(V) is due the 
the controlled-!/ and -V', and the term linear in m is 
due to the multiply controlled operations. Rearranging 
this inequality gives the desired result. QED 

The following corollary follows from the theorem and 
the results of Section ITU 

Corollary 1 There exist positive constants C\ and c 2 
such that for any U and m we can construct an m + 1- 
fold special extension, U m (the canonical extension), such 
that: 

mm(m,cid(I,U m ) - c 2 m) < Goc(t^) (97) 
poly(G oc (C/,e)) < d(I,U m ). (98) 

In order to apply the theorem and corollary, we need to 
find scenarios where Goc(U) and Goo(U) behave in essen- 
tially the same way. We now show that this is the case 
for Boolean functions, / : B n —> B, where B = {0,1} 
is the set of states of a single bit, and B n is the set of 
states of a string of n bits. An extension to more complex 
classical functions may be performed along similar lines. 

We define G oc (f) to be the minimal number of quan- 
tum gates required to exactly compute f(x). That is, it is 
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the minimal number of one- and two-qubit gates required 
to compute a unitary V such that: 



V\x)\0) = \f(x))\A 



(99) 



where \A X ) is some "junk" final state that will be ignored. 
We define G c 00 {f) to be the minimal classical circuit com- 
plexity required to exactly compute /. 

Suppose we define a unitary Uf by Uf\x)\z) = \x)\z © 
/(&)), where addition is done modulo two, and a unitary 
Vf by Vf\x) = (—l)f( x '\x). Then the following theorem 
shows that the quantum circuit complexity of / is es- 
sentially equal to G(Uf) and G(Vf). Thus, Theorem [3] 
and Corollary Q] may be applied to obtain insight into the 
quantum circuit complexity of Boolean functions. 

Theorem 4 

GUf) > GooCf) = e(G oc ([/ / )) = QiG^Vf)) (100) 

Proof: The first inequality is obvious. The first equal- 
ity follows by standard techniques of reversible computa- 
tion [25|,[26[. In brief, note that by definition C?oo(/) < 
Gcx>(Uf) < G oo(U /). Conversely, let V be the unitary of 
minimal gate complexity satisfying Equation (|99p . Then 
by applying V to the first and third register of |x)|.z)|0) 
we obtain \f(x))\z)\A x ). Adding the value of the first 
register to the second and then applying V* we obtain 
\x)\z © /(a;))|0). It follows that G^Uf) < 2G tX) (/) + 1, 
and thus G oo (f) = 0(G oo (f)). 

The second equality follows by standard techniques of 
phase estimation [271 HI]; see e -g-j Section 5.2 of 
QED 

To conclude this section, we give some examples of 
Theorem [3] and its consequences in action. We will focus 
on the behaviour of Goo(U), assuming that we are work- 
ing in a situation like that provided by Theorem [1J e.g., 
with a class of unitaries for which G^U) and Goo(U) 
behave similarly. 

A simple example of the theorem is to suppose that U 
is an n-qubit unitary for which we can prove 



,1(1. V n 2) . ( ^- + 5 j ir 



(101) 



for some 6 > 0. It need not be that d(I,U n 2) actu- 
ally scales quadratically — it would be just as good if 
d(I,U n 2) = 2™, for example. Substituting m = n 2 into 
the theorem, it follows that: 



Goo (E0 > min(l,<5)n 2 = n(n 2 ). 



(102) 



Thus, if Goo(U) ~ Goo(U), then we can prove that the 
number of gates required to synthesize U scales at least 
as tt(n 2 ). 

This conclusion perhaps appears somewhat surprising. 
After all, U n i involves n 2 qubits, and so surely we would 
expect d(I, U n i) to scale as in Equation (|101[) . no matter 
what U is. The resolution is that it is only the excess 



beyond (c2/c\)n 2 that contributes to the bound on the 
gate complexity. Fortunately, there are many situations 
where such an excess is likely to occur. To see this, recall 
from Equation that 



& G(t/,e) 6l e 62 



< d{I,U). 



(103) 



In the papers 0, [f| the constants found were b\ — 
1/3,62 = 2/3 and 63 = 2; a value for 60 was not calcu- 
lated explicitly. It is now straightforward to prove that: 



Proposition 1 

&oGoo([/,e) bl e b2 



(n + m) b3 



< d(I, U„ 



(104) 



Proof: Simply observe that G O0 (Z7, e) < G(U m , e), and 
then apply ifTOg]) . QED 

This proposition shows that rapid scaling in Goo {U, e) 
implies rapid scaling in d(I,U m ). As a result, if U is 
difficult to approximate, then the geometric properties 
imply that U is difficult to compute exactly, even when 
ancilla are allowed. Needless to say, if U is difficult to 
approximate, then it is difficult to compute exactly. The 
significance of the Proposition is that it provides circum- 
stances under which we can guarantee something about 
the behaviour of the geometry. 

As an example, suppose we define a = (1 + b^)/bi, 
and that G^U, 1/10) = fl(n a+s ) for some S > 0. Sup- 
pose we choose 7 > such that 7 < S/a. Then ap- 
plying the proposition we see with a little algebra that 
d(I, U n i+y) — Q(n 1+ P) for some /3 > 7. In such a situa- 
tion, it follows from Theorem [3] that the geometric prop- 
erties imply superlinear lower bounds on the exact gate 
complexity G 00 (J7). 

In a similar vein, if we have Gao(U, 1/10) = f2(2 Cl "), 
and choose a positive value for C2 such that c 2 < c\ja 
then we see from the proposition that d{I,U2^) — 
f2(2 C3 ") for some C3 > C2, and thus by Theorem [3] the 
geometric properties imply exponential lower bounds on 
the exact gate complexity G 00 ([/). 

So, for example, if, as suspected by many people, it 
turns out that NP-hard problems require exponential 
size quantum circuits to approximate, then it will imme- 
diately follow that there are constants < C2 < C3 such 
that d(I,U 2 °2n) = Q(2 nC3 ), and thus, by Theorem [3] the 
geometric properties imply exponential lower bounds on 
the exact gate complexity. 



VII. CONCLUSION 

We have explored the basic geometry of quantum 
computation, including the Levi-Civita connection, the 
geodesic equation and many solutions and invariants of 
the equation, as well as all the basic curvature quan- 
tities. We have also developed a geodesic deformation 
procedure which in many cases of interest allows us to 
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find geodesies connecting the identity / to some desired 
unitary U . This gives a more or less complete picture of 
the basic geometry of quantum computation, and should 
provide a foundation for a more detailed understanding. 
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APPENDIX A: CURVATURE 

In this appendix we derive explicit expressions for the 
various quantities describing curvature. This includes the 
curvature tensor (Subsection IA 1[) . the sectional curva- 
ture (Subsection IA 2p . the Ricci tensor (Subsection I A 3| . 
and the scalar curvature (Subsection I A 4[) . Note that 
many of these quantities are presented for a general right- 
invariant metric in 29]. However, it is helpful to have 
explicit forms of these curvature quantities for the stan- 
dard metric, Equation @, and so we present detailed 
calculations. 



1. Curvature tensor 

In this section we compute the Riemann curva- 
ture tensor, R, which is a (°) tensor field defined by 
R(W,X,Y,Z) ee (V W V X Y - V X V W Y - V mx] Y,Z). 
We work with respect to a basis of right-invariant frame 
fields p, a, t, [i, corresponding to generalized Pauli matri- 
ces, and compute the corresponding components of the 
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curvature tensor, 

R paTp = (V p V ff r- V ct V p t- V iM T,n) (Al) 

= (VpT,V CT ^) - (V CT T,V p ^) - (V l[p . CT ]T,/i), 

(A2) 

where in the second line we used the fact that 
{X, VyZ) = -{VyX, Z) for any triple of right-invariant 
vector fields, X, Y and Z. Using the formula of Equa- 
tion (l25l), we see that 



CT,T 



(A3) 



where 



(B{X,Y),Z) andVxY = \[i[X : Y]-B{X : Y)-B{Y,X)). 
Using these facts and the cyclic property of trace a cal- 
culation shows that 



K{X,Y) = --{i[X,Y],i[X,Y}) 



(B(X, Y) + B(Y, X),B(X, Y) + B(Y, X)) 



[X,Y],B(X,Y)~B(Y,X)). 



(A7) 



For values of q of computational interest (indeed, for any 
q > 4/3) it is easily verified from this formula that the 
sectional curvature can be both positive and negative. 



gr ~ Qa 
Q[ct,t] 



(A4) 



Note that we use q a to refer to the value of the penalty for 
the generalized Pauli matrix a, i.e., q„ = 1 if a has weight 
zero, one or two, and otherwise q a = q. The notation 
q[a, T ] is the value of the penalty for the generalized Pauli 
matrix proportional to [a, r] ; in the trivial case when a 
and r commute, we arbitrarily assign q[a, T ] = 1- We use 
a similar convention for expressions like c\ p ^ T . 

Substituting Equation (|A3|) into Equation (|A2D , we ob- 
tain 



R 



-C[p,<r],T(i[i[P, 



(A5) 



We use this as our basic expression for the curvature ten- 
sor, and derive other curvature quantities starting from 
this point. In doing so we will often find it helpful to 
use the observation that (er, r) = q a 5 aT . Note that this 
expression has several symmetries in addition to those 
satisfied in general by the curvature tensor. In partic- 
ular, it is easy to verify that to have R parp / we 
must have parp proportional to the identity, and it must 
be possible to partition the indices into two pairs, e.g., 
(p, a) and (r, /i), such that: (1) the pairs commute, i.e., 
[p, a] = [r, p] = 0; and (2) all other pairs anticommutc, 
i.e., [p, t] + = [p, p\ + = . . . = 0. Even when these condi- 
tions hold, individual terms in the expression (|A5[) may 
still vanish, e.g., the first term vanishes if p and r com- 
mute, or if a and p commute. 



2. Sectional curvature 

The sectional curvature in the tangent plane spanned 
by orthonormal right-invariant vector fields X and Y is 
defined by 



K(X,Y) = R(X,Y,Y,X). 



(A6) 



Define a bilinear operation B(X, Y) = ^(^(X), Y] 
Observe that we have the identities (X, i [Y, Z] ) 



Ricci tensor 



The Ricci tensor, Rc aTl is defined as the contraction 
of the raised form of the curvature tensor, R paT ^ , on the 
first and last indices, 



RCar 



R 9 



gWR 



pCFTfl ■ 



(A8) 
(A9) 



16|) that g pfl is nonzero only 
■po-Tfi vanishes unless parp oc /, 



the components 



Observing from Equation 
when p = p, and that R 
we see that Rc aT must be diagonal, i.e 
vanish unless a = r. 

To compute the diagonal entries in the Ricci tensor, 
we observe from Equation (|16p that the diagonal entries 
of the metric g pT are equal to l/q p . We thus obtain (no 
implied sum on repeated indices) 



RCaa = ^2 



Ra 



(A10) 



Using the expression of Equation (| A5|) . the definition 
Equation (|A4j) . and the observations (i[p, a], i\p, a]) = 
^Q[p,tr] anc i = — 4q p , we obtain after some 

algebra 



Rc aa = ( 2 + — 



2q P q<7 



3q? 



2Q[p,a]<lo 



1p1[p,o 



(All) 



where the prime indicates that the sum is only over p 
which anticommute with a. This sum may be further 
simplified by observing that up to proportionality fac- 
tors p and [p, a] range over the same set of matrices, i.e., 
generalized Pauli matrices which anticommute with a. 
Using this fact a change of variables may be used to show 
that the sum of the -2g p q CT / 'q p q[ p ,a] and 2q[ Pja ]q (T / 'q p q[p, a ] 
terms cancel. For similar reasons, the q 2 p / q P q[p,a] an d 
— ^1[p a}l 1p1[p,"] terms partially cancel. Finally, provided 
a ^ /, a simple counting argument shows that the num- 
ber of p which anticommute with a is 4"/2. Combining 
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all these observations, we obtain 



Rc a 



' 2 n 2 



(A12) 



To evaluate this more explicitly, let us define N a (V, V) 
to be the number of generalized Pauli matrices p such 
that: (1) p anticommutes with a; (2) p is in V, i.e., has 
only one- or two-body terms; and (3) [p, a] is also in V . 
We make analogous definitions for N a (V, Q), N a (Q, V) 
and N a (Q, Q). Expressed in these terms we have: 

Rc aa = 4" + q 2 a \N a (V, V) + ^N a (V, Q) 

+ -N a (Q,P) + \N a (Q 1 Q) 
q q A 

1 



(A13) 



-2 [N a {V,V) + -N a {V,Q) 
1 



+ qN<r(Q,V) + N a (V,V) 



Elementary counting allows us to evaluate the factors 
N a (-, •). It is most convenient to consider separately the 
cases where the weight w of a is 1,2,3 and 4 or more. 
The corresponding values for Rc aa are 

/4™ \ 1 

wt(o-) = 1 : Rc aa = 2(3n - 2) + ( — - 2(3n - 2) J — 

(A14) 

wt(cr) = 2 : Rc aa = -24(n - 2)q + 8(6n - 11) 
4" 

T 



4™ \ 1 

3(3n-5)J^ (A15) 



4™ 

wt(cr) = 3 : Rc aa = 12q 2 + — + 36(n - 3) 

-12(3n-8)- (A16) 
4™ 

wt(er) = w > 4 : Rc aa = — + 4w(3n - 2w) 

-4w(3n-2w)-. (A17) 
9 

The Ricci flow: In Section [V] we study the way 
geodesies deform when the metric is smoothly changed. 
A well-known method for changing the metric is the 
Ricci flow introduced by Hamilton [3(j and recently 
used by Perelman [3l|, [13, HI] in the resolution of the 
Poincare conjecture. The normalized Ricci flow is an 
equation for the metric tensor defined in components by 
dg aT /ds = -2R aT + 2Rg aT /{A n - 1). This equation de- 
fines a smooth family g s of metrics on the manifold. Al- 
though we do not seriously study the Ricci flow in this 
paper, we now briefly digress to note some interesting 
properties of the behaviour of the standard metric un- 
der the Ricci flow. Our numerical investigations suggest 
that the normalized Ricci flow takes the standard metric 
to the bi- invariant metric with q = 1, up to an overall 



scaling factor. This is interesting, and deserves further 
study, for the geodesies of the bi-invariant metric are well 
understood. 

To understand the normalized Ricci flow, observe from 
Equation (|A12|) that if q a depends only on the weight of 
a, then the resulting diagonal entries Rc aa in the Ricci 
tensor depend only on the weight of a. As a result, under 
the Ricci flow we can assume that the metric tensor is 
always diagonal with entries that depend only on the 
weight. 

To obtain an explicit expression for the metric under 
the Ricci flow, we define N a (v, w) to be the number of p 
with weight v such that the commutator [a, p] is nonva- 
nishing with weight w. Equation (|A12|) can be rewritten 



RCn 



(A18) 



where q v is the penalty for Pauli matrices of weight v. 
To find a simple formula for N a (v,w), we observe that 
N„(v,w) — 3 V (™)p(ui|er, v), where p(w\a, v) is the condi- 
tional probability that a random Pauli of weight v will 
commute with a to give a Pauli of weight w. This prob- 
ability can be computed by conditioning on the size of 
the overlap between a and the Pauli of weight v. The 
probability is zero unless wt(<r) -I- v — w is an odd and 
positive number. If that is the case then 



2 Wt(ff)+v-tu ^ \ 3 J \ k 
k 



4\ /wt(er)\ (n — wt(er) 
; - k 

i , , ] > ( A19 ) 

Wt(<7) + V — W — K ' 

where k runs over the possible sizes of the overlap region. 
A similar calculation can be done to obtain an expression 
for the scalar curvature. These provide elegant expres- 
sions for the Ricci tensor and scalar curvature in the cases 
when the metric is diagonal with entries depending only 
on the weight, and are useful in numerically simulating 
the normalized Ricci flow. 



4. Scalar curvature 

Returning to the study of the standard metric, the 
scalar curvature is obtained from the Ricci tensor via 
the contraction R = Rcf = Rc aa /q a . Using Equa- 
tions (IA14[) - (|A17|) we obtain 



R = -54n(n- l)(n - 2)g + 6n(36n 2 
3n(3n- 1)^ 4" 

1 



99?i + 64) 



4" - 1 



- 6n(45n 2 - 117n + 74) 
[3n(3n- 1)4™- 1 



9 



1 



6n(3n-4)(6n-7)l 

q A 
(A20) 
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For large n and fixed q the dominant terms in the scalar 
curvature are therefore 

lfi™ 1 1 

R ~ -54n 3 q + 216?i 3 + ^n 2 ^' 1 — . (A21) 

2 q q 2 

We see that provided q ~ 4™, the scalar curvature is nec- 
essarily negative. Remarkably, the proof in Q that Equa- 
tion ((5| holds also requires q ~ 4™ (or larger), and thus 
entails negative scalar curvature. Whether a relationship 
like Equation ([5]) can be proved for smaller values of q 
(and thus for positive scalar curvature) remains an open 
question. 

The scalar curvature can be shown to be the average 
of the sectional curvature, 



where /i(X, Y) is the normalized measure induced by our 
metric on the space of orthonormal X and Y. Note that 
the constant of proportionality out the front is 4" — 1 if 
we are working on U (2™), and is 4™ — 2 if we are working 
on SU(2 n ). This suggests (and Equation (|A7|) can be 
used to verify) that typical values of the sectional cur- 
vature are negative. It is well known that on manifolds 
with everywhere negative curvature, the dynamical sys- 
tem defined by the geodesic flow is ergodic and mixing; 
see Sections 10.5 and 10.6 of [l|| for an overview and ref- 
erences. This suggests the conjecture that such ergodic 
and mixing behaviour may be seen at least on parts of our 
manifold. If true, this may have interesting implications 
for quantum computation. 



R = (4" - 1) / d(i(X, Y)K(X, Y), (A22) 



